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Description 

This invention relates to apparatus and method by which broadcast information can be recognized and 
classified. More particularly, this invention relates to a system and method for classifying broadcast 

s information using a plurality of reference signal libraries in a two-stage classification process. 

It is known that broadcast stations (television and radio) are regularly monitored to determine when and 
how often certain information is broadcast. For example, artists may be paid a royalty rate depending upon 
how often their particular work is broadcast. Likewise, commercial backers of broadcast programming have 
an interest in determining when and how often commercials are played. Further, marketing executives and 

w the broadcasters themselves are interested in determining the popularity of certain broadcast information in 
order to target that information to the appropriate audience at the appropriate time. Those of ordinary skill in 
this field will readily understand that a wide variety of legal, economic and social concerns require the 
regular monitoring of broadcast information. All such requirements share a common need for certain 
information such as which information was broadcast and when. 

75 Traditionally, such broadcast station monitoring was performed manually by a plurality of listeners who 
would physically monitor the broadcast program and manually tabulate which information was broadcast 
and when. However, the cost of these manual surveys has become prohibitive. Such a method is labor 
intensive and subject to reliability problems. For example, a manual monitor may easily miss a fifteen 
second commercial broadcast over radio. In addition, it is virtually impossible for a single individual to 

20 monitor a plurality of broadcast channels. Therefore, a great number of monitors has been traditionally 
required to fully monitor performance in a multi-media environment. 

In view of the above problems with manual systems, it has been proposed to design and implement an 
automatic broadcast recognition system. It is believed that such automatic systems will be less expensive 
and more reliable than manual surveys. 

25 In recent years, several techniques and systems have been developed which electronically monitor 
broadcast signals and provide information relative to the content and timing of the program monitored. 
Initially, these automatic systems performed signal recognition by inserting a code signal in the broadcast 
signal itself. Upon reception, the automatic system would recognize the code signal (matching it with a 
reference library) and classify the broadcast information accordingly. Although such coding techniques work 

30 for limited applications, they require allocation of portions of the broadcast signal band for identification 
purposes. In addition, such a system requires special processing, coding and decoding circuitry. Such 
circuitry is expensive to design and assemble and must be placed at each transmitting and receiving 
station. In addition, those of skill in this field understand that government regulatory agencies are adverse to 
providing additional bandwidth for purposes of code signal identification. 

35 To overcome some of the disadvantages involved with the use of the coded signal techniques, certain 
automatic broadcast signal identification systems have been developed which do not require special coding 
of the broadcast signal. Such a system is disclosed in U.S. Patent 3,919,479 to Moon et al. In Moon et al, 
an audio signal is digitally sampled to provide a reference signal segment which is stored in a reference 
library. Then, when the audio signal is broadcast, successive portions thereof are digitized and compared 

40 with the reference segment in the library. The comparison is carried out in a correlation process which 
produces a correlation function signal. If the reference and broadcast signal segments are not the same, a 
correlation function with a relatively small amplitude results. On the other hand, if the reference and 
broadcast signal segments are relatively the same, a large correlation function signal is produced. The 
amplitude of the correlation function signal is sensed to provide a recognition signal when the amplitude 

45 exceeds a predetermined threshold level. 

While the Moon et al system may operate effectively in certain situations, it is not effective for many 
applications. For example, where signal drop-out is experienced, a single segment correlation system may 
be severely degraded or disabled all together. Additionally, the Moon et al system is relatively insensitive to 
time-axis variations in the broadcast information itself. For example, it is known that many disc-jockeys 

50 "compress" broadcast songs by speeding-up the drive mechanism. It is also known that other disc-jockeys 
regularly "compress" and/or "stretch" broadcast information to produce certain desired effects in the 
audience. 

In an attempt to overcome such time-axis variations, Moon proposes to reduce the bandwidth of the 
broadcast signal by envelope-detecting the broadcast signal and providing envelope signals having 
55 substantially low, and preferably sub-audio frequency signal components. It has been found that when the 
envelope signal at sub-audio frequencies is used during the correlation process, the digitally sampled 
waveforms are less sensitive to time-axis variations. However, the improvements which can be achieved by 
such a solution are very limited and will only operate for broadcast signals which have been "compressed" 
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or "stretched" by a small amount. In addition, such a solution is subject to high false alarm rates. These 
disadvantages make the Moon et al system less than desirable for a rapid, accurate, and inexpensive 
broadcast information recognition system. 

Another automatic signal recognition system is disclosed in U.S. Patent 4,450,531 to Kenyon et al. Mr. 

5 Kenyon is a joint inventor of the subject application and the '531 patent. The teachings of the '531 patent 
are hereby incorporated into this application by reference. 

The Kenyon et al system successfully addresses the reliability problems of a single segment correlation 
system, and the time-axis variation problems experienced by prior systems. In Kenyon et al, a plurality of 
reference signal segments are extracted from a program unit (song), digitized, Fourier transformed and 

w stored in a reference library in a frequency domain complex spectrum. The received broadcast signal is 
then prefiltered to select a frequency portion of the audio spectrum that has stable characteristics for 
discrimination. After further filtering and conversion to a digital signal, the broadcast signal is Fourier 
transformed and subjected to a complex multiplication process with reference signal segments to obtain a 
vector product. The results of the complex multiplication process are then subjected to an inverse Fourier 

75 transformation step to obtain a correlation function which has been transformed from the frequency to the 
time domain. This correlation function is then normalized and the correlation peak for each segment is 
selected and the peak spacing is compared with segment length. Simultaneously, the RMS power of the 
segment coincident with the correlation peak segment is sensed to determine the segment power point 
pattern. Thus, Kenyon et al overcomes the disadvantages of a single segment correlation system by 

20 providing a plurality of correlation segments and measuring the distances between correlation peaks. Where 
the distances match, the broadcast signal is declared as being similar to the signal segments stored in the 
reference library. In addition, the RMS value comparison operates to confirm the classification carried out 
using the signal segments. 

To overcome the time-axis variation problem, Kenyon et al utilizes an envelope detector and band pass 

25 filtering of the broadcast information, similar to the system of Moon et al. In addition, Kenyon et al, 
proposes the use of more than one sampling rate for the reference signal segments. A fast and slow 
sample may be stored for each reference signal segment so that broadcast signals from faster rate stations 
will correlate with the faster rate reference segments and signals from slower rate stations will correlate with 
the slower rate reference segments. However, the system according to Kenyon et al also suffers from a 

30 relatively high false alarm rate and is computationally very demanding. For example, performing the various 
multi-segment correlations requires a great deal of computer power. Since a multitude of segments are 
sampled, the system according to Kenyon et al may take a good deal of time and require the use of 
expensive, powerful computers. 

A system for speech pattern recognition is disclosed in U.S. Patent 4,282,403 to Sakoe. Sakoe 

35 discloses a speech recognition system in which a time sequence input of pattern feature vectors is inputted 
into a reference library. The received speech signal is then subjected to spectrum analysis, sampling and 
digitalization in order to be transformed into a timed sequence of vectors representative of features of the 
speech sound at respective sampling instances. A time warping function may be used for each reference 
pattern by the use of feature vector components of a few channels. The time warping function for each 

40 reference pattern feature vector is used to correlate the input pattern feature vector and the reference 
pattern feature vector. The input pattern feature vector sequence is then compared with the reference 
pattern feature vector sequence, with reference to the warping function, in order to identify the spoken word. 
However, the Sakoe system time warps the reference patterns rather than the input signal. Thus, a plurality 
of patterns must be calculated for each reference pattern, necessarily increasing the memory and 

45 computational requirements of the system. 

A further signal recognition system is disclosed in U.S. Patent 4,432,096 to Bunge. In Bunge, sounds or 
speech signals are converted into an electrical signal and broken down into several Spectrum components 
in a filter bank. These components are then integrated over a short period of time to produce the short-time 
spectrum of a signal. The spectral components of the signal are applied to a number of pattern detectors 

so which supply an output signal only if the short-time spectrum corresponds to the pattern adjusted in the 
relevant pattern detector. Each pattern detector has two threshold detectors which supply a signal if the 
applied input lies between the adjustable thresholds. Thus, the pattern detectors supply an output signal 
only if all threshold value detectors are activated. For each sound of speech, a pattern detector is provided. 
When a series of sounds is recognized, the series of addresses of the pattern detectors which have 

55 successfully generated an output signal are stored and subsequently applied to the computer for compari- 
son. It can be readily appreciated that such a system requires a number of pattern detectors and a 
corresponding powerful computation device. In addition, while the Bunge system uses a filter bank to 
provide a low frequency output signal which is relatively less sensitive to time-axis variations, the Bunge 
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system is still subject to time distortion problems and a high false alarm rate. 

Known automatic broadcast recognition systems have been caught in a quandary of choosing an 
appropriate time-bandwidth (sampling time times frequency band width) product. Where the broadcast 
signal is sampled with a large time-bandwidth product, signal recognition may be made accurately, 
s However, when a suitably large time-bandwidth product is employed, it will be extremely sensitive to time- 
axis variations. Thus, most known systems utilize a predetermined time-bandwidth product and suffer 
recognition inaccuracies and time-axis variations. In addition, the computational load imposed by all known 
techniques severely limits the number of songs or other recordings that can be simultaneously sampled in 
real time. 

w Thus, what is needed is a small, inexpensive system with limited processing power which automatically 
monitors a plurality of broadcast channels simultaneously for a large number of sounds. Such a system 
should provide accurate recognition and, at the same time, remain relatively insensitive to time-axis 
variations. 

75 SUMMARY OF THE INVENTION 

The present invention is designed with a view toward overcoming the disadvantages of known automatic 
broadcast information recognition systems while at the same time satisfying the objectives alluded to above. 
These problems and objectives are solved by the method and apparatus for classifying broadcast 

20 information comprising the characterising features of independent claims 1 and 14 respectively. 

The present inventors have discovered that an inexpensive, reliable and accurate automatic information 
classification system may be achieved by utilizing a two-stage classification process. First, known broadcast 
information (a song or commercial) is "played into" the system in order to generate first and second stage 
reference libraries. Once the libraries have been generated, broadcast information is monitored by the 

25 system. In the first stage classification, the input signal is spectrally analyzed and filtered to provide several 
low bandwidth analog channels. Each of these channels is fed to a feature generator where it is digitized to 
form a feature data set that is analyzed to determine if it matches one of the patterns in the first stage 
reference library. In a preferred embodiment the feature generator forms a multi-channel sequence by 
computing linear combinations of the input channels. Each of these feature sequences is then smoothed 

30 using a moving average filter, further reducing the bandwidth. These reduced bandwidth sequences are 
then resampled to form a feature set of very low bandwidth but long duration. These sequences are 
grouped into a spectragram and used in the first stage classification process to rule out unlikely candidates 
in the first stage reference library. In addition, the feature generator generates an additional feature 
sequence which will be used in the second stage classification process. 

35 Preferably each spectragram is a time/frequency matrix having a plurality of elements. Likewise, the 
first stage reference patterns are also preferably time/frequency matrices having the same number of 
elements as the generated spectragram. The first stage classification process then compares the generated 
spectragram with the first stage reference spectragram. The reference spectragram may be visualized as a 
template which is "laid-over" the generated spectragram so that corresponding matrix elements match. 

40 Then, the difference between corresponding elements of the generated spectragram and the first stage 
reference spectragram is measured to determine the similarity between the generated spectragram and the 
reference spectragram. Then, the sum of the element differences for the entire spectragram is obtained to 
provide a difference measurement between the generated spectragram and the first stage reference 
spectragram. This difference computation is repeated for each pattern in the first stage reference library. 

45 Songs having a difference measurement less than a threshold value are considered to be candidate 
identifications. Those with difference measurements greater than a threshold value are rejected as being not 
similar to the broadcast information. 

The first stage reference patterns which have not been rejected in the first stage classification process 
are then queued according to their difference measurements. Thus, the queueing order places the most 

50 similar first stage reference pattern at the head of the queue. 

Next, the second stage reference classification process is carried out in the queueing order established 
in the first stage classification process. The second stage reference library contains a number of signal 
patterns which corresponds 1 -to-1 with the entries of the first stage reference library. The second stage 
reference patterns are queued in the order established in the first stage classification process, and then 

55 correlated with the additional feature sequence provided from the feature generator. This additional feature 
sequence does not have as low a bandwidth as the feature sequence used in the first stage classification 
process. In a preferred embodiment, a cross correlation is conducted between the additional feature 
sequence and the second stage reference patterns in the queueing order. If the peak correlation value for 
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any of the cross-correlations exceeds a detection threshold, the broadcast information is classified as being 
similar to the information represented by the second stage reference pattern. At this time a recognition may 
be declared and the time, date and broadcast information identification, and broadcast information source 
may be entered in a detection log. 

5 By performing the computationally demanding cross-correlation in the queueing order established in the 

less-demanding first stage classification process, processing resources are conserved and the computer 
power required to classify broadcast information is significantly reduced. 

To account for time-axis variations in the broadcast information, a preferred embodiment may include a 
"time-warping" function for use in the second stage classification process. Specifically, the additional 

w feature sequence provided to the second stage classification process may be "compressed" and/or 
"stretched to account for variations in broadcast speed. Then, the second stage correlation process 
correlates the second stage reference pattern with the unmodified additional feature sequence, with a 
"compressed" additional feature sequence, and/or with a "stretched" additional feature sequence. There- 
fore, proper identification can take place even if the broadcast information is broadcast more rapidly or 

75 more slowly than intended. 

BRIEF DESCRIPTION OF TEE DRAWINGS 

The advantageous features according to the present invention will be readily understood from the 
20 description of the presently preferred exemplary embodiment when taken together with the attached 
drawings in which: 

FIG. 1 is a block diagram depicting the system according to the presently preferred embodiment; 
FIG. 2 is a block diagram showing the filter banks of FIG. 1; 

FIG. 3 depicts a series of waveforms showing the wave-shaping carried out in the filter banks of FIG. 2; 
25 FIG. 4 is a series of waveforms showing four feature sequences generated by the feature generator 
processor of FIG. 2; 

FIG. 5 depicts a spectragram which is constructed with relation to the waveforms shown in FIG. 4; 
FIGs. 6(a) and 6(b) depict the first stage comparison process carried out between the generated 
spectragram and the first stage reference matrix; 
30 FIG. 7 shows time-warped versions of the input waveform used in the second stage classification 
process; 

FIG. 8 is a series of waveforms depicting the digitized feature sequence used in the second stage 
classification process, the normalized second stage reference pattern, and the cross-correlation function 
therebetween; 

35 FIG. 9 is a top-level flow chart depicting a method according to the preferred embodiment; 

FIG. 10 is a flow chart of one step of FIG. 9; 

FIG. 11 is a flow chart of another step of FIG. 9; 

FIG. 12 is a flow chart of yet another step from FIG. 9; 

FIG. 13 is a flow chart showing still another step from FIG. 9; and 
40 FIG. 14 is a flow chart showing the confirming step according to FIG. 9. 

DETAILED DESCRIPTION OF THE PRESENTLY PREFERRED EXEMPLARY EMBODIMENT 

While the present invention will be described with reference to a broadcast music classification system, 
45 those with skill in this field will appreciate that the teachings of this invention may be utilized in a wide 
variety of signal recognition environments. For example, the present invention may be utilized with radio, 
television, data transfer and other broadcast systems. Therefore, the appended claims are to be interpreted 
as covering all such equivalent signal recognition systems. 

First, an overview of the invention will be provided for clarification purposes. Reference may be had to 
50 FIG. 9 for this overview. The automatic recognition of broadcast recordings is useful in determining the rate 
of play and the time of play of these recordings to determine royalties, projected sales, etc. The prior art in 
this area has met with limited success due to the necessity of using a relatively large bandwidth product to 
ensure accuracy. When a suitably large time-bandwidth product is employed, most techniques experience 
difficulty due to speed variations common in broadcast music stations. In addition, the music computation 
55 load imposed by these techniques limits the number of songs or other recordings that can be simulta- 
neously searched for in real time. 

The present invention manages the large processing load imposed by a large signature data base 
through the use of an efficient, though less accurate, first (screening) stage classification process. This first 
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stage eliminates from further consideration songs that are clearly different from the current input signal. 
Only those patterns that match the input reasonably well in the first stage are queued for intensive scrutiny 
by the accurate but computationally demanding second stage. This is because the queueing order 
established in the first stage classification process is ranked by order of similarity to the broadcast song. 

s Thus, the second stage will first consider the most likely candidates before the less likely candidates. Early 
recognition will result with a corresponding decrease in computer processing power. This two-stage 
classification process results in a classification system whose overall capacity is increased by over an order 
of magnitude compared to known classification systems. 

The use of additional stages in a hierarchical classification structure could provide additional capacity 

w for a given processing resource. For example, much larger data bases may require a three-stage 
classification process to again conserve processing power. Those of skill in this field will readily understand 
that the teachings in this invention may also be applicable to a three-or-more stage classification process. 

FIGs. 1 and 9 depict apparatus and method according to the presently preferred embodiment. The 
audio signal from one or more broadcase sources is input to an audio channel receiver 4 through antenna 

75 means 2. In the present invention, the processing structure of FIG. 1 allows simultaneous processing of up 
to five audio channels. Therefore, up to five broadcast stations may be monitored and their broadcast 
programs classified. Additional hardware and software modifications could be performed to increase or 
decrease the number of channels simultaneously monitored. 

From audio channel receiver 4, the input audio signal is provided to an audio preprocessor 6. Audio 

20 preprocessor 6 may include filter banks 8, envelope detector 10, and low pass filters 12. The audio 
preprocessor performs a coarse spectral analysis by bandpass filtering the audio input into several bands. 
These bands are then envelope detected and lowpass filtered to form several low bandwidth analog 
channels. Each of these channels is then fed to a processor which performs a feature generation function. 
Specifically, the processor digitizes and further processes the low bandwidth analog channels to form a 

25 feature data set that is analyzed to determine whether it matches one of the patterns of the first stage 
classification library. In the preferred embodiment, the feature generating processor forms a four-channel 
sequence by computing linear combinations of the input channels. Each of these feature sequences is then 
smoothed using an averaging filter, further reducing the bandwidth. These reduced bandwidth sequences 
are then resampled to form a feature set of very low bandwidth but long duration. 

30 The sequences are then grouped into a spectragram and used by the first stage classification process 
to rule out unlikely candidates. In addition, a fifth sequence is generated by the feature generating 
processor for use in the second stage classification process. 

The four input channels are combined in a weighted sum to form a feature sequence with specific 
properties. In the process of forming the linear combinations, weighting coefficients are used which are 

35 specially selected to minimize the influence of broadband impulsive energy. It has been found that this 
greatly reduces sensitivity to speed variations and ampiitude distortions that frequently result from the use 
of compressors by broadcast stations. 

The second stage feature sequence provided by the feature generating processor is not filtered and 
resampled, but is used at a relatively large bandwidth. Since it is used at this greater bandwidth, a feature 

40 sequence that is long enough to provide satisfactory pattern discrimination will be very sensitive to speed 
variations. To counter this, the sequence is resampled at slightly different rates to form several new 
sequences that represent the input waveform at different speeds. This process is referred hereinafter as 
"time warping". A recording that is broadcast faster than normal must be expanded or stretched to replicate 
the original waveform. Similarly, a recording that is broadcast slower than normal must be compressed. The 

45 set of compressed and expanded waveforms comprise a linearly time warped set of replicas of the input 
feature sequence. 

The first classification stage operates on the low bandwidth spectragrams, treating them as time- 
frequency matrices of the most recent feature data. These spectragrams should be normalized to 
compensate for gain variations in the broadcast and receive systems. 

50 In the first stage classification process, the most recently generated spectragram is compared with 
every reference pattern stored in the first stage reference library. For each first stage reference pattern in 
the library, a distance is computed between it and the input spectragram. This distance may be computed 
as the sum of differences between corresponding elements of the input and reference matrices. This 
distance is a measure of the similarity between the current broadcast song and the subject reference 

55 recording. This distance computation is repeated for each pattern in the data base. Songs whose distances 
are less than a threshold value are considered to be candidate identifications. Those with distances greater 
than the threshold value are rejected. 

To conserve processing resources and insure that the most likely candidates are considered first, the 
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first stage classifications are positioned in a queue according to their distance scores. Patterns that have 
passed the first stage classification test and entered into the queue are subject to a confirming classification 
in the second stage. This classification process uses the single channel wider bandwidth feature set, 
including the time warped replicas. For each entry in the queue, the corresponding single channel reference 

5 pattern is compared with each of the time warped replicas of the most recent feature vector. A correlation 
procedure is employed that involves computing the cross correlation function and then scanning it to select 
the maximum value. This is repeated for each of the time warped replicas. If the peak correlation value for 
any of the correlations exceeds a detection threshold, a recognition is declared and the time, date, song 
identification number, and radio station are entered in a detection log. If none of the songs in the queue 

w passes the confirming classification, the next time segment is analyzed in the same way. In such a fashion, 
an inexpensive, efficient and accurate broadcast music classification system is realized. Therefore, a small 
system with limited processing power can monitor several radio channels simultaneously for a large number 
of songs. This large capacity has an economic advantage in that the revenue producing capability of a 
single monitoring unit is proportional to the number of songs monitored times the number of stations under 

75 surveillance. 

The first stage features have a reduced dimensionality and can be computed and evaluated at low 
computation cost. Nevertheless, the effect of the first stage processing is to reduce the number of song 
candidates to a small fraction of the data base size. The second stage of classification uses a song 
signature of significantly higher dimensionality (time band width product) than the first stage, makes song 

20 detection decisions only for songs queued in the first stage, and only for those songs which are identified 
as probable for the first stage. The first stage has a song detection threshold bias towards high detection 
rates and moderate false alarm rates. The second stage has both high detection rates and low false alarm 
rates. The net effect of this two stage detection procedure is the ability to monitor a large number of songs 
over several channels using only limited processing power. Thus, the apparatus and method according to 

25 the present invention may provide an economically significant broadcast information classification system. 

Turning now to FIG. 1, the apparatus according to the present invention will be described. Antenna 2 
receives radio waves including audio signals. The antenna apparatus is capable of receiving up to five radio 
channels simultaneously. The audio signal is received by audio channel receiver 4, and provided to audio 
preprocessor 6 through MULTIBUS (TM of Intel Corp.) 100. Audio preprocessor 6 includes filter banks 8, 

30 envelope detectors 10, and low pass filters 12. The audio preprocessor 6 will be described in more detail 
with reference to FIG. 2. 

FIG. 1 also depicts analog-to-digital converter 14 which may be used to digitize the audio signal. 
Multiplexer 16 is used to carry out multiplexing operations when a plurality of audio signals is being 
simultaneously classified. Both A/D converter 14 and multiplexer 16 are also coupled to MULTIBUS (TM of 
35 Intel Corp.) 100. 

Also coupled to MULTIBUS (TM of Intel Corp.) 100 is an array processor 18. Array processor 18 
comprises a Marinco 3024 CPU 20 and a feature vector operations section 22. Both the feature vector 
operations section 22 and the CPU 20 are coupled to MULTIBUS (TM of Intel Corp.) 100. The functions of 
array processor 18 include the time warping of the second stage feature sequence and the second stage 

40 correlation computations. 

The processor 24 is also coupled to MULTIBUS (TM of Intel Corp.) 100 and performs the functions of 
control, data-base management, all in/out (I/O) management, and the first stage classification calculations. 
Processor 24 may include a microprocessor 26, a memory 28, I/O interfaces 30, a real time clock 32, 
reference pattern memory 34 and off-line memory 36. Preferably, microprocessor 26 may be a Motorola 

45 68000 series microprocessor. Preferably, working memory 28 includes two Megabytes of memory. Likewise 
pattern memory 34 stores both the first stage and second stage reference libraries and preferably is 
realized by a 10 Megabyte hard disk. The off-line memory 36 may be used to change/add/delete reference 
patterns from the reference pattern libraries in memory 34. Preferably off-line memory 36 comprises a one 
Megabit floppy disk. 

50 Finally, the processing system may be coupled with such peripherals as CRT 38, printer 40, and 
terminal 42. Such peripherals are coupled to the system through I/O interfaces 30. 

Turning now to FIG. 2, the coarse spectral analysis step S100 of FIG. 9 will be described. The received 
audio signal is provided to audio preprocessor 6 where it is divided into a plurality of channels. In the 
presently preferred embodiment, four channels have been selected. However, greater or fewer channels 

55 may be used depending upon the exact type of information which is to be classified. Each channel includes 
a bandpass filter 8, a rectifier 10, and a low pass filter 12. The purpose of the audio preprocessor is to 
reduce the amount of information processed in the first stage. This provides a long term averaging of the 
first stage features. Since the purpose of the first stage is to reduce the computation required for 
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recognition, it is desirable to reduce the amount of information processed per unit time. Signal discrimina- 
tion accuracy is proportional to the time bandwidth product of the feature vector. Therefore, by reducing 
feature vector bandwidth while expanding duration, accuracy is maintained while required processing per 
unit time is decreased. This is true for any process that requires continuous searching for time series 
5 events. 

In order to accomplish this, the audio input signal depicted in FIG. 3 is provided to each of bandpass 
filters 8. Each bandpass filter outputs a signal depicted in FIG. 3 as the bandpass filtered signal. The filtered 
signals are provided to rectifiers 10, each of which outputs a waveform shown in FIG. 3. Finally, the rectified 
signals are provided to lowpass filters 12, each of which outputs a lowpass filtered signal, as depicted in 

w FIG. 3. By sampling the reduced bandwidth signal, processing time is conserved while simultaneously 
reducing the sensitivity of the system to speed variations in the audio signal. Therefore, from lowpass filters 
12 are provided a plurality of waveforms as depicted in FIG. 4. These waveforms are respectively denoted 
by Xi(t), X 2 (t), X 3 (t), and X*(t). Each of these waveforms is provided to processor 24 which generates 
feature sequences according to the waveforms. 

75 Processor 24 thus provides a plurality of feature sequences denoted by X S i(t), X S2 (t), X S3 (t), X S4 (t) and 
Xc(t). Each of these feature sequences 25 is formed as a linear combination of all four waveforms Xi (t) 
through X^(t). As shown in FIG. 4, at a time ti the four waveforms are sampled and amplitude voltages V A i, 
V B i, V C i and V D1 are respectively measured. Then, for time ti a feature vector may be calculated for each 
of the waveforms. The feature vector is a series of numbers describing characteristics of the input signal. In 

20 the preferred embodiment, a feature vector for waveform X S i(A) at time ti may be calculated as follows: 

Xsi(ti) = KiV A1 + K 2 V B i + K 3 V C1 + K4V D1 (1) 

Thus, each sequence of feature vectors includes components from each of the four waveform bands. The 

25 coefficients K are selected to specifically suppress noise. 

The special selection of coefficients K is used to suppress the effects of amplitude distortion in the 
broadcast signal. Amplitude distortion (sometimes denoted as "amplitude compression") sometimes is 
intentionally applied by certain broadcast stations to avoid overdriving inexpensive receivers. Such 
"amplitude compression" degrades the similarity of a stored reference pattern to that computed from the 

30 input radio signal. For a given level of detection of reliability, this requires larger reference patterns to be 
used than would be necessary if the distortion were not present. The need for large reference patterns 
causes a reduction in processing efficiency, particularly making it difficult to employ an effective first stage 
which makes preliminary decisions using song signatures of low dimensionality. The newly developed 
approach overcomes this distortion problem by taking explicit advantage of the spectral Properties of this 

35 distortion. 

The "amplitude compression" process does not significantly effect narrowband signal components, but 
primarily affects impulsive components which are wideband in nature. A multi-channel time series consisting 
of a frequency based time series before compression will be denoted as fj(tj). After compression, each band 
time series becomes: 

40 

Qi(tj) = ai(tj) + fi(tj) (2) 

where compression is described as an additive component aj(tj) to each band. In the implemented 
approach to suppressing effects of amplitude compression, the additive components aj(tj) are assumed to 
45 be linearly related. Thus, it is assumed that there exists a linear equation which approximately estimates 
each aj(tj) based on the values of a k (tj) for k * i. Thus, 

ai(tj) = E b ik a k (tj) + ei(tj) ... ( 3 ) 

50 K^l J 



where ej(tj) is the estimation residual for time tj. 

If the fj may be approximated as independent (which they are not), then the coefficients b ik may be 
55 estimated by the correlation process from a data epoch covering time (ti , t M ) through the solution of the set 
of (N-1) linear equations: 
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(a if a k ) - Z b ik * <a k ',a k ) « 0 ... (4) 

5 

for each k*i where 

M 

(a k ,a k ') -I a k (tj)a k *(tj) ... (5) 

10 J "* 1 

Since the [f k ] are approximate as independent, it follows that (a k ,a k f ) = (g k ,g k ')for W± k. 
This estimate of (a k ,a k ') is most accurate when the a k (tj) take on their largest magnitude in comparison 
with the f k (tj). This occurs when amplitude compression is the greatest. Thus, the estimate is made: 

75 

(a k ,a k ') = (g k ,g k ') (6) 
where (g k ,g k ' )' = E g k (t j )g k , (t j ) (7) 
20 {j:|g(tj)|>T} 

where |g(tj)| is a measure of the magnitude of the received broadcast signal and T is a selected 
threshold above which the signal is considered to be heavily compressed. 

The set of linear equations is solved for estimates of the b ik . Then, the effect of compression is 
25 suppressed by replacing received energy band time series gj(tj) by: 

g*i<tj) - 9i<tj> - bik 9k(tj) ... (8) 

30 

To a linear approximation, the N time series g'j(tj) have had the effect of compression removed. This 
approach suppresses linearly dependent information between energy bands and emphasizes linearly 
independent information. The linearly dependent information can be added, for improved recognition 

35 purposes, but must be downweighted because of its vulnerability to amplitude compression. 

What is achieved by this method is a set of g'j(tj) which are relatively immune to the effects of 
compression. The coefficients b ik may be estimated using data from an ensemble of broadcast music and 
transmitting stations so that they do not have to be re-estimated for each broadcast station, and so that they 
are independent of the music being transmitted. 

40 Sampling of the waveforms in FIGURE 4 is preferably conducted at a rate of eight times per second. 
The bandwidth is preferably H Hz. The Nyquist sampling theory indicates that the sampling rate should be 
approximately 4 Hz. The present inventors have chose a sampling rate of 8 Hz in order to ensure greater 
accuracy. 

Referring now to FIGURE 5, it can be seen that processor 24 constructs a spectragram in accordance 
45 with the linear combinations of the waveforms of FIGURE 4. Thus, as shown in FIGURE 5, each block 
contains data integrated over eight seconds of time. Thus, the spectragram is a matrix having four spectral 
channels and eight time channels. Each matrix element contains a feature component calculated as 
described above. The spectragram is computed as indicated in step S110 of FIGURE 9. This will be 
described in more detail with reference to FIGURE 10. 
50 According to FIGURE 10, the feature data sets determined in step S100 are smoothed using a moving 
average filter, as indicated in step S101. Next, at step S103, the waveforms are resampled to form a low 
bandwidth multi-channel time series. This has the further advantage of reducing the sensitivity to speed 
variations. Finally, at step S105 the time/frequency matrix of the most recent sample is formed, as depicted 
in FIGURE 5. 

55 Once the first stage spectragram has been generated, the spectragrams should be normalized to 
compensate for gain variations in the broadcast and receive systems. This step is depicted as S130 in 
FIGURE 9, and is further illustrated in FIGURE 11. To accomplish the normalization, all elements of the 
input spectragram are summed in step S111. This sum represents the total energy in the spectragram. 
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Then, at step S113, each element of the spectragram is divided by the spectragram sum. This produces a 
spectragram with unit energy for easy comparison with the reference pattern. 

After normalization, the input spectragram and the reference spectragram from the first stage reference 
library are compared in a preliminary classification step S150, as shown in FIGURES 9 and 13. Each 

5 element of the input and reference spectragrams preferably includes 16 bits representing the value of the 
matrix element. As visually depicted in FIGURES 6(a) and 6(b), the first stage comparison is merely a 
matter of matching the input signal spectragram with the reference matrix. This process can be visualized 
as overlaying a reference template on the input signal spectragram. 

Since each of the signal and reference matrices contains the same number of elements, a 1-to-1 

w comparison between matrix elements is conducted. As shown in FIGURES 6(a) and 6(b), the value of matrix 
element X S i,i is compared with matrix element X R1i1 . This may be visualized as comparing the distances 
between the two element values. Returning to FIGURE 12, the distance between the input spectragram is 
determined in step S131. This is accomplished by summing the differences between corresponding 
time/frequency elements of the signal spectragram and the reference matrix. This distance calculation is 

75 carried out for each-of the entries in the first stage reference library. 

Next, at step S133, all first stage reference matrices whose distance measurements are less than a 
predetermined threshold are accepted as likely candidates. Those first stage reference matrices whose 
distance measurements exceed the threshold are rejected as unlikely candidates. 

Once a distance measurement has been calculated for each matrix in the first stage reference library, 

20 those songs that are identified as likely candidates are subjected to a sort and queueing step S170, as 
depicted in FIGURES 9 and 13. As discussed above, by queueing the songs in their order of similarity to 
the input signal, the computationally demanding second stage classification will be greatly abbreviated. It 
should be noted that a wide variety of sort and queueing procedures are available for carrying out this step. 
The inventors have decided to utilize the sorting and queueing procedure depicted in FIGURE 13. 

25 At step S151 of FIGURE 13, the distance value for each queue entry is set to a maximum. Next, for 
each song whose distance measurement is less than the threshold value a queue entry is generated 
containing the song number and its distance score, as shown in step S153. Then, for each new entry into 
the queue, the queue is scanned from the end to locate the rank order position for the new entry. The new 
entry is then inserted into the queue at the appropriate space. Entries having a larger distance than the new 

30 entry will then be moved toward the end of the queue. This process is depicted in step S155. Lastly, in step 
157, the array processor is directed to process songs in the queue in ascending order of distance 
measurements. Thus, a reference spectragram which has a low distance value from the input song will be 
subjected to the second stage classification before a reference spectragram having a higher distance 
measurement. 

35 The above-described procedures complete the first stage classification process. The first stage 
produces a queue ordered by similarity to the input song. This queue order will be used in the second 
stage classification process to compare the most likely reference songs to the input song. 

Second stage classification actually begins with the generation of the wider bandwidth feature sequence 
Xc(t), as depicted in FIGURE 2. As discussed above, it is necessary to "time warp" the second stage 

40 feature sequence in order to account for speed variations in the broadcast signal. For purposes of the 
preferred embodiment, it is assumed that all such speed variations are constant, and thus the time warping 
feature of the preferred embodiment is a linear time warp. 

Radio stations are known to introduce significant speed variations into recordings that are broadcast. 
For a feature vector with a sufficient time-bandwidth product to provide near error free recognition, most 

45 recognition systems are intolerant of these speed variations. This problem has been addressed in the 
Kenyon et al patent referred to above through the use of a segmented correlation approach. In this 
approach, short feature vectors with relatively small time-bandwidth products were identified separately. The 
final recognition decision was based on the timing of individual segment recognitions. While this procedure 
allowed recognition of songs with substantial speed variation, it did not take full advantage of the fact that 

50 these speed differences introduce a linear time base error. The method according to the present invention 
is to linearly compress or expand the time base of the input feature sequence until the distortion has been 
removed. The entire feature vector is then compared with undistorted vectors from the reference set. The 
compression and expansion of the feature vector is performed by resampling and interpolating the input 
sequence. This linear time warping is done in small enough increments that at least one increment will 

55 match the stored reference with essentially no degradation. Such time warping is depicted in FIGURE 7. 

As shown in FIGURE 7, the input waveform can be compressed into a compressed waveform, and/or 
stretched into an expanded waveform. According to the preferred embodiment, a set of four time warped 
waveforms are provided in addition to the un-warped waveform. For purposes of broadcast music 
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recognition, applicants have chosen to provide waveforms compressed by 2% and 4%, and waveforms 
stretched by 2% and 4%. Thus, two compressed waveforms, two stretched waveforms, and the original 
waveform are provided for comparison to the second stage reference library. 

Next, as depicted by step S270 in FIGURE 9, a confirming classification is carried out between the time 

5 warped (and un-warped) waveforms and the reference patterns in the second stage reference library 
according to the queueing order established in step S170. 

Generally, this confirmation classification is carried out in accordance with the teachings of the Kenyon 
et al patent incorporated herein by reference. In brief, a correlation is a mathematical means of measuring 
how well two waveforms compare. If one of the two waveforms is of finite length and is permanently stored 

w in an appropriate memory storage device, a running comparison of the finite stored waveform against a 
second continuous waveform may be accomplished through on-line solution of the correlation integral 
equation to produce a third waveform known as the correlation function. When the continuous waveform 
contains a signal segment (which may even be obscured by noise) which matches the stored waveform, the 
correlation function attains a large value. The sensing of this large value constitutes a recognition, and is the 

75 process by which the occurrence of a commercial advertisement or song is recognized in the process 
according to the present invention. 

The pattern matching according to the present invention involves a correlation procedure with a 
minimum of 50% overlap of input data and zero filling of reference patterns so that linear correlations are 
generated instead of circular correlations. The correlation procedure involves computing the cross correla- 

20 tion function and then scanning it to select the maximum value. This is repeated for each of the time 
warped replicas. If the peak correlation value for any of the correlations exceeds a detection threshold, a 
recognition is declared and the classification process for that song is over. The confirmatory classification 
process will now be described with reference to FIGURES 8 and 13. 

In FIGURE 8, the digitized broadcast waveform may be one of the time warped (or un-warped) 

25 waveforms generated in step S210. For example, this digitized broadcast may represent 512 samples of the 
audio signal input, taken over a 64 second period. Next, a normalized reference pattern from a second 
stage reference library is matched to an arbitrary portion on the digitized broadcast waveform. Note that 
only the first half of the normalized reference contains a waveform, the second half being zero filled. Zero 
filling is used to accomplish segment splicing which takes the newest reference block and concatenates it 

30 to the previous block. Next, a cross correlation is carried out between the digitized broadcast and the 
normalized reference to provide a correlation function, as depicted in FIGURE 8. Note that correlation peak 
CP indicates a high correlation between the digitized broadcast and the normalized reference at that 
particular point. 

The correlation process is carried out by first computing the Fourier transform of all five time warped 

35 and un-warped waveforms. This provides complex conjugate spectra which are compared with the second 
stage reference patterns. The reference patterns themselves have been previously normalized so that no 
additional normalization is required for the reference patterns. Next, samples from the digitized broadcast 
and reference waveform are cross multiplied and inverse Fourier transformed to provide the correlation 
signal depicted in FIGURE 8. Note that the correlation function in the zero filled half of the normalized 

40 reference waveform is minimal. Thus, only correlations in the first half of the correlation function are valid. 
The second half of the correlation function is generated by taking the correlation waveform and essentially 
reversing it to get a mirror image of the reference waveform. The above-described cross correlation process 
is depicted at step S211 in FIGURE 14. 

Next, the correlation functions between each second stage reference pattern and the plurality of time- 

45 warped (and un-warped) input signals are compared to select the maximum correlation value for the current 
input song, as depicted in step S213. The appropriate waveform with the highest correlation value is 
selected and compared to a threshold value which determines recognition, as depicted in step S215. As 
soon as a correlation peak value is determined to be above the pre-set threshold, a victory is declared and 
it is determined that the song has been "recognized". Then, the time of detection, the date, the 

50 broadcasting station, and the song number may be derived and provided to output peripheral equipment. 
Such decision logging may be carried out as step S300, depicted in FIGURE 9. This completes the second 
stage classification process. 

Thus, the above-described system and method provides for an accurate, reliable, yet inexpensive 
system for classifying broadcast information. Those of skill in this field will understand that a wide variety of 

55 modifications may be made without departing from the spirit and scope of the subject invention. 

An additional advantage of the apparatus according to the present invention is that it may be used to 
generate the first and second stage reference libraries. This is an automatic training procedure in which 
broadcast songs are "played into" the system to provide the first and second stage reference patterns. The 
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automatic training procedure first selects the most spectrally distinctive section of the song for use in 
reference pattern generation. Certain sections of a recording are more spectrally distinctive than others. 
When such a section is used to generate a reference pattern, the performance of a pattern recognizer is 
improved since it operates on more information. A measure of the distinctiveness of a portion of a recording 
5 is the bandwidth of this feature vector. This can be estimated as follows: 



w 



W - W 

B=[Z X(W)r/[X 

o o 



X 2 (w)] 



... (10) 



where X(w) represents the power spectral density at any particular frequency. Very large bandwidths can be 
produced by songs with impulsive features. While these features are distinctive, they are more subject to 

75 distortion and require greater processor dynamic range. 

Areas containing impulsive features may be located by computing the crest factor as a ratio of the peak 
feature value to the standard deviation of the feature vector computed in the same region. A composite 
figure of merit is then computed as either the ratio of, or the difference between, the bandwidth and crest 
factor of the second stage feature vector. This is repeated in small increments of time (for example one 

20 second) throughout the song. At positions where the figure of merit is highest, a first stage reference feature 
matrix is computed and tested for time alignment sensitivity. For timing errors as large as can be 
encountered due to offsets and time scale errors, the resultant distance must remain below a threshold 
value. The position in the song for training is selected as the one with the highest second stage figure of 
merit that also passes the first stage time sensitivity test. Those of skill in this field will appreciate that the 

25 same hardware used to conduct music classification can also be used to generate the first and second 
stage reference libraries. 

Therefore, what has been described above is apparatus and method for automatically classifying 
broadcast information according to stored reference patterns. Since the system is microprocessor based, it 
can be realized in an extremely small and economical package. The costs of constructing and installing 
30 such systems will be economically advantageous. Those of skill in this field will readily understand the 
advantages achieved by the structure and functions of the above-described invention. 

While the present invention has been described in connection with what is presently considered to be 
the most practical and preferred embodiments, it is to be understood that the invention is not limited to the 
disclosed embodiment. 

35 

Claims 

1. Method of classifying broadcast information, comprising the steps: 
receiving broadcast information; 
40 correlating (S270) the broadcast information with a library of second stage reference pattern; and 

classifying (S270, S300) said broadcast information as similar to one of said second stage 
reference patterns based on said correlating step, 
characterized in that 

between receiving and correlating said broadcast information is compared (S150) with a library of first 
45 stage reference patterns, said first stage reference patterns being queued (S170) in an order of their 
similarity to said broadcast information; and 

that said library of second stage reference patterns corresponds to said first stage reference patterns in 
the queueing order established in the queueing step. 

50 2. A method according to claim 1 further including the steps of: 

generating a plurality of analyzed waveforms from said broadcast information; 
time warping one of said analyzed waveforms to provide at least one time warped waveform; and 
wherein said correlating step includes the step of correlating both said one analyzed waveform and 
said time warped waveform with said library of second stage reference patterns. 

55 

3. A method according to claim 2 wherein said time warping step includes the step of linearly time 
warping said one analyzed waveform to provide a stretched waveform. 
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4. A method according to claims 1 wherein said receiving step includes the step of simultaneously 
receiving a plurality of broadcast information, and wherein said steps of comparing, queueing, 
correlating, and classifying are performed on said plurality of broadcast information, substantially 
simultaneously. 

5 

5. A method according to claim 1 further including.the steps of: 

processing said broadcast information to provide a plurality N of analyzed signal patterns cor- 
responding to said broadcast information; 

generating a spectragram from said analyzed signal patterns; 
w providing said spectragram to said comparing step for comparison to said first stage reference 

patterns; and 

providing one of said analyzed signal patterns to said correlating step for correlation with said 
second stage reference patterns. 

75 6. A method according to claim 5 wherein said step of processing includes the steps of: 
bandpass filtering said broadcast information into a plurality of bands; and 

computing a plurality of linear combinations of said plurality of bands to provide said plurality of 
analyzed signal patterns. 

20 7. A method according to claim 6 wherein said step of bandpass filtering includes the steps of bandpass 
filtering, rectifying, and then lowpass filtering each said band. 

8. A method according to claim 5 wherein said step of generating a spectragram includes the steps of: 

sampling said plurality of analyzed signal patterns at a predetermined rate; 
25 constructing a time/frequency matrix having N frequency channels, n time channels, and a plurality 

of N(n) of matrix elements; 

calculating, for each matrix element, a matrix value X N , n as follows: 

X N , n (t) = [K, x V An (t)] + ... [Kn x V Nn (t)] 

30 

where: t equals time at which samples are taken from said plurality of analyzed signal patterns; 

V An through V Nn are amplitude values of first through N th analyzed signal patterns taken 
at sample time t; and 

35 K| through K N are constants preselected to minimize the influence of broadband impulsive 

energy. 

9. A method according to claim 8 further including the step of normalizing said matrix to provide said 
spectragram. 

40 

10. A method according to claim 8 wherein each said first stage reference pattern comprises an Nxn 
reference matrix of elements, and wherein said step of comparing comprises the steps of: 

measuring a variation between each element of the time/frequency matrix and a corresponding 
element of each of said reference matrices; 
45 summing the measured variations between the time/frequency matrix and each reference matrix to 

provide a distance measurement for each reference matrix; 

comparing each distance measurement with a threshold value; and 

eliminating those first stage reference patterns whose corresponding distance measurement ex- 
ceeds said threshold value. 

50 

11. A method according to claim 10 wherein said queueing step comprises the step of ordering non- 
eliminated first stage reference patterns according to their corresponding distance measurements. 

12. A method according to claim 1 wherein said correlating step includes the steps of: 

55 calculating a correlation value for at least one second stage reference pattern with reference to said 

broadcast information; and 

comparing said correlation value with a threshold correlation value; and 

wherein said classifying step includes the step of classifying said broadcast information as similar 



13 



EP 0 319 567 B1 



to only the second stage reference pattern whose corresponding correlation value exceeds said 
threshold reference value. 

13. A method according to claim 1 further including the training steps of: 
s (a) analyzing reference broadcast information to identify a spectrally distinctive portion thereof; 

(b) determining a figure of merit for said distinctive portion using a peak value and a standard 
deviation value from said distinctive portion; 

(c) generating a first stage reference pattern from said distinctive portion; 

(d) testing the, generated first stage reference pattern for time alignment sensitivity; 

w (e) repeating said steps (a)-(d) when the generated first stage reference pattern does not pass the 

time sensitivity test; and 

(f) using said spectrally distinctive portion to provide first and second stage reference patterns when 
said generated first stage reference pattern passes the time sensitivity test. 

75 14. Apparatus for classifying broadcast information, comprising: 
means (2,4) for receiving broadcast information; 

processing means (18) for correlating the broadcast information with a library of second stage 
reference patterns (34); and 

processing means (24;26) for classifying said broadcast information as similar to one of said second 
20 stage reference patterns based on said correlation, 
characterized by 

processing means (24;26;) for comparing said information with a library of first stage reference patterns 
(34); and 

processing means (24;26) for queueing the first stage reference patterns in an order of their similarity to 
25 said information, said processing means (18) for correlating being supplied with said library of second 
stage reference patterns, which correspond to said first stage reference patterns in the queueing order 
established. 

15. Apparatus according to claim 14 further including means for generating a plurality of analyzed 
30 waveforms from said broadcast information; 

and wherein said processing means (24) includes processing means for (a1) time warping one of 
said analyzed waveforms to provide at least one time warped waveform; and wherein said processing 
means for (c) correlating includes processing means (18) for correlating both said one analyzed 
waveform and said time warped waveform with said library of second stage reference patterns. 

35 

16. Apparatus according to claim 15 wherein said processing means for (a1) time warping includes 
processing means for linearly time warping said one analyzed waveform to provide a stretched 
waveform. 

40 17. Apparatus according to claim 14 wherein said means (4) for receiving includes means for simulta- 
neously receiving a plurality of broadcast information, and wherein said processing means (24) includes 
means for performing the processing functions (a)-(d) on said plurality of broadcast information, 
substantially simultaneously. 

45 18. Apparatus according to claim 14 wherein said processing means for (a) comparing includes processing 
means for (a1) processing said broadcast information to provide a plurality N of analyzed signal 
patterns corresponding to said broadcast information; (a2) generating a spectragram from said analyzed 
signal patterns; (a3) providing said spectragram to said processing means for (a) comparing for 
comparison to said first stage reference patterns; and (a4) providing one of said analyzed signal 

50 patterns to said processing means for (c) correlating for correlation with said second stage reference 
patterns. 

19. Apparatus according to claim 18 wherein said processing means for (a) comparing includes processing 
means (6) for (a1a) bandpass filtering said broadcast information into a plurality of bands; and (alb) 

55 computing a plurality of linear combinations of said plurality of bands to provide said plurality of 
analyzed signal patterns. 

20. Apparatus according to claim 19 wherein said means (6) for (a1a) bandpass filtering includes means for 



14 



EP 0 319 567 B1 



rectifying (10), and then lowpass filtering (12) each said band. 

21. Apparatus according to claim 18 wherein said processing means for (a2) generating a spectragram 
includes processing means for (a2a) sampling said plurality of analyzed signal patterns at a predeter- 

5 mined rate; (a2b) constructing a time/frequency matrix having N frequency channels, n time channels, 

and a plurality of N(N) of matrix elements; 

(a2c) calculating, for each matrix element, a matrix value X N n as follows: 

X N>n (t) = [K, x V An (t)] + ... [Kn x V Nn (t)] 

w 

where: t equals time at which samples are taken from said plurality of analyzed signal patterns; 

V An through V Nn are amplitude values of first through N th analyzed signal patterns taken 
at sample time t; and 

75 K| through K N are constants preselected to minimize the influence of broadband impulsive 

energy. 

22. Apparatus according to claim 21 wherein said processing means for (a2) generating a spectragram 
includes processing means for normalizing said matrix to provide said spectragram. 

20 

23. Apparatus according to claim 21 wherein each said first stage reference patterns comprises an Nxn 
reference matrix of elements, and wherein said processing means for (a) comprising includes process- 
ing means for (a5) measuring a variation between each element of the time/frequency matrix and a 
corresponding element of each of said reference matrices; (a6) summing the measured variations 

25 between the time/frequency matrix and each reference matrix to provide a distance measurement for 
each reference matrix; (a7) comparing each distance measurement with a threshold value; and (a8) 
eliminating those first stage reference patterns whose corresponding distance measurement exceeds 
said threshold value. 

30 24. Apparatus according to claim 23 wherein said processing means for (b) queueing includes processing 
means for queueing non-eliminated first stage reference patterns according to their corresponding 
distance measurements. 

25. Apparatus according to claim 14 wherein said processing means for (c) correlating includes processing 
35 means for (c1) calculating a correlation value for at least one second stage reference pattern with 

reference to said broadcast information; and (c2) comparing said correlation value with a threshold 
correlation value; and wherein said processing means for (d) classifying includes processing means for 
classifying said broadcast information as similar to only the second stage reference pattern whose 
corresponding correlation value exceeds said threshold reference value. 

40 

26. Apparatus according to claim 14 wherein said processing means includes processing means for (e) 
analyzing said broadcast information to identify a spectrally distinctive portion thereof; (f) determining a 
figure of merit for said distinctive portion using a peak value and a peak value standard deviation from 
said distinctive portion; (g) generating a first stage reference pattern from said distinctive portion; (h) 

45 testing the generated first stage reference pattern for time alignment sensitivity; (i) repeating the 
functions (e)-(h) when the generated first stage reference pattern does not pass the time sensitivity test; 
and (j) using said spectrally distinctive portion to provide first and second stage reference patterns 
when said generated first stage reference pattern passes the time sensitivity test. 

50 Patentanspruche 

1. Verfahren zur Klassifizierung von Rundfunkinformationen mit den Verfahrensschritten: 
Empfang von Rundfunkinformation; 

Korrelieren (S270) der Rundfunkinformation mit einer Bibliothek aus Bezugsmustern einer zweiten 
55 Stufen; 

und auf der Basis dieses Korrelierens durchzufuhrendes Klassifizieren (S270,S300) dieser Rundfunkin- 
formation hinsichtlich ihrer Ahnlichkeit mit einem der Bezugsmuster der zweiten Stufe, 
gekennzeichnet dadurch, 
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daS zwischen Empfangen und Korrelieren diese Rundfunkinformation mit einer Bibliothek aus Bezugs- 
mustern einer ersten Stufe verglichen wird (S150), wobei diese Bezugsmuster der ersten Stufe in einer 
Reihenfolge (S170) nach ihrer Ahnlichkeit mit dieser Rundfunkinformation eingeordnet werden; und 
daS diese Bibliothek der Bezugsmuster der zweiten Stufe den Bezugsmustern der ersten Stufe in der 
s Reihenfolge entspricht, die in dem Verfahrensschritt der Einordnung in Reihenfolge erstellt worden ist. 

2. Verfahren nach Anspruch 1 mit den weiteren Verfahrensschritten: 

Erzeugen einer Vielzahl von analysierten Wellenformen aus dieser Rundfunkinformation; 
Zeitverzerrung einer dieser analysierten Wellenformen, urn wenigstens eine zeitverzerrte Wellenform zu 
w erhalten; 

wobei der Verfahrensschritt des Korrelierens das Korrelieren sowohl der einen analysierten Wellenform 
als auch dieser zeitverzerrten Wellenform mit der Bibliothek der Bezugsmuster der zweiten Stufe 
einschlietft. 

75 3. Verfahren nach Anspruch 2, 

bei dem der Verfahrensschritt der Zeitverzerrung lineares Zeitverzerren dieser einen analysierten 
Wellenform einschlietft, urn eine gestreckte Wellenform zu erhalten. 

4. Verfahren nach Anspruch 1 , 
20 wobei der Verfahrensschritt des Empfangens das gleichzeitige Empfangen einer Vielzahl von Rundfunk- 
informationen einschlietft und wobei der Verfahrensschritt des Vergleichens, des Einordnens in einer 
Reihenfolge, des Korrelierens und des Klassifizierens hinsichtlich dieser Vielzahl von Rundfunkinforma- 
tionen im wesentlichen gleichzeitig ausgefuhrt wird. 

25 5. Verfahren nach Anspruch 1 mit den weiteren Verfahrensschritten: 

Verarbeiten der Rundfunkinformation, urn eine Vielzahl (N) analysierter Signalmuster zu erhalten, die 
dieser Rundfunkinformation entsprechen; 

Erzeugen eines Spektrogramms dieser analysierten Signalmuster; Verwenden dieses Spektrogramms 
in dem Verfahrensschritt des Vergleichens, urn einen Vergleich mit den Bezugsmustern der ersten 
30 Stufe durchzufuhren und 

Verwenden eines dieser analysierten Signalmuster im Verfahrensschritt des Korrelierens, urn Korrela- 
tion mit den Bezugsmustern der zweiten Stufe durchzufuhren. 

6. Verfahren nach Anspruch 5, 
35 wobei dieser Verfahrensschritt des Verarbeitens die folgenden Verfahrensschritte einschliefit: 
Bandpaflfiltern dieser Rundfunkinformation in eine Vielzahl von Bandern; und 

Errechnen einer Vielzahl linearer Kombinationen dieser Vielzahl von Bandern, urn die Vielzahl analysier- 
ter Signalmuster vorzusehen. 

40 7. Verfahren nach Anspruch 6, 

wobei dieser Verfahrensschritt des Bandpaflfilterns das Bandpaflfiltern, Gleichrichten und dann Tiefpa/3- 
filtern eines jeden Bandes einschlietft. 

8. Verfahren nach Anspruch 5, 
45 wobei der Verfahrensschritt des Erzeugens eines Spektrogramms die folgenden Verfahrensschritte 
einschlieflt: 

Abtasten einer Vielzahl analysierter Signalmuster mit einer vorgegebenen Abtastrate; 
Aufstellen einer Zeit/Frequenz-Matrix mit N Frequenzkanalen, mit n Zeitkanalen und mit einer Vielzahl 
N(n) Matrixelementen; 
50 Errechnen aus einem jeden Matrixelement einen Matrixwert X N>n wie folgt: 

X N , n (t) = (K, x V An (t)) + ... (Kn x V Nn (t)) 

worin: 

55 t der Zeitpunkt ist, zu dem die Abtastproben dieser Vielzahl analysierter Signalmuster genommen 
worden sind; 

V An bis V Nn die Amplitudenwerte der ersten bis N-ten analysierten Signalmuster sind, die zur Abtastzeit 
t abgetastet sind; und 
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K| bis K N voraus ausgewahlte Konstanten sind, mit denen der Einflu/3 von Breitband-lmpuls-Storenergie 
minimiert wird. 

9. Verfahren nach Anspruch 8, 

5 wobei weiterhin ein Normieren dieser Matrix eingeschlossen ist, urn das Spektrogramm zu schaffen. 

10. Verfahren nach Anspruch 8, 

wobei ein jedes der Bezugsmuster der ersten Stufe eine N x n-Bezugsmatrix aus Elementen umfatft 
und wobei das Vergleichen die folgenden Verfahrensschritte umfaflt: 
w Messen einer Variation zwischen einem jeden Element der Zeit/Frequenz-Matrix und einem korrespon- 
dierenden Element einer jeden dieser Bezugsmatrizen; 

Aufsummieren der gemessenen Variationen zwischen der Zeit/Frequenz-Matrix und einer jeden Be- 
zugsmatrix, um einen Abstandsme/Swert fur eine jede Bezugsmatrix zu schaffen; 
Vergleichen eines jeden Abstandsmetfwertes mit einem Schwellenwert; und 
75 Ausschlie/ten solcher Bezugsmuster der ersten Stufe, deren entsprechender Abstandsmeflwert diesen 
Schwellenwert ubersteigt. 

11. Verfahren nach Anspruch 10, 

wobei der Verfahrensschritt des Einordnens in die Reihenfolge den Verfahrensschritt des Einordnens 
20 nicht ausgeschlossener Bezugsmuster der ersten Stufe entsprechend ihrer Abstandsmeflwerte beinhal- 
tet. 

12. Verfahren nach Anspruch 1, 

wobei das Korrelieren die folgenden Verfahrensschritte einschlieflt: Errechnen eines Korrelationswertes 
25 fur wenigstens ein Bezugsmuster der zweiten Stufe in Bezug auf die Rundfunkinformation; und 
Vergleichen dieses Korrelationswertes mit einem Korrelations-Schwellenwert; 

wobei dieser Verfahrensschritt des Klassifizierens das Klassifizieren dieser Rundfunkinformation hin- 
sichtlich Ahnlichkeit mit nur demjenigen Bezugsmuster der zweiten Stufe einschlietft, dessen entspre- 
chender Korrelationswert diesen Bezugs-Schwellenwert ubersteigt. 

30 

13. Verfahren nach Anspruch 1, 

wobei des weiteren die folgenden Ubungsschritte eingeschlossen sind: 

a) Analysieren einer Referenz-Rundfunkinformation, um einen spektral klar erkennbaren Anteil 
derselben zu identifizieren; 

35 b) Bestimmen einer Guteziffer fur diesen klar erkennbaren Anteil unter Benutzung eines Scheitelwer- 

tes und eines Standard-Abweichungswertes aus diesem klar erkennbaren Anteil; 

c) Erzeugung eines Bezugsmusters der ersten Stufe aus diesem klar erkennbaren Anteil; 

d) Testen des Bezugsmusters der ersten Stufe fur Zeitabgleichempfindlichkeit; 

e) Wiederholen dieser Verfahrensschritte a) bis d) dann, wenn das erzeugte Bezugsmuster der 
40 ersten Stufe den Zeitempfindlichkeitstest nicht passiert; und 

f) Benutzen des spektral klar erkennbaren Anteils, um Bezugsmuster der ersten und der zweiten 
Stufe dann vorzusehen, wenn erzeugte Bezugsmuster der ersten Stufe den Zeitempfindlichkeitstest 
passieren. 

45 14. Vorrichtung zum Klassifizieren von Rundfunkinformation mit: 
Mitteln (2,4) zum Empfangen von Rundfunkinformation; 

Prozessormittel (18) zum Korrelieren der Rundfunkinformation mit einer Bibliothek aus Bezugsmustern 
(34) der zweiten Stufe; und 

Prozessormittel (24,26) zum Klassifizieren dieser Rundfunkinformation hinsichtlich Ahnlichkeit mit einem 
50 dieser Bezugsmuster der zweiten Stufe, beruhend auf dieser Korrelation; 
gekennzeichnet dadurch, 

Prozessormittel (24,26) zum Vergleichen dieser Information mit einer Bibliothek aus Bezugsmustern 
(34) einer ersten Stufe und Prozessormittel (24;26) zum Einordnen der Bezugsmuster der ersten Stufe 
in eine Reihenfolge hinsichtlich deren Ahnlichkeit mit dieser Information, wobei die Prozessormittel (18) 
55 fur das Korrelieren mit der Bibliothek der Bezugsmuster der zweiten Stufe gespeist werden, die den 
Bezugsmustern der ersten Stufe, diese in der Reihenfolge aufgestellt, entsprechen. 

15. Vorrichtung nach Anspruch 14, 
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ale des weiteren Mittel zur Erzeugung einer Vielzahl analysierter Welienformen der Rundfunkinforma- 
tion einschlie/St; 

und wobei diese Prozessormittel (24) Prozessormittel (a1) zur Zeitverzerrung einer dieser analysierten 
Welienformen einschlie/ten, urn wenigstens eine zeitverzerrte Wellenform zu erzeugen, und wobei diese 
s Prozessormittel zur (c) Korrelation Prozessormittel (18) zum Korrelieren sowohl dieser einen analysier- 

ten Wellenform als auch der zeitverzerrten Wellenform mit dieser Bibliothek aus Bezugsmustern der 
zweiten Stufe einschlie/ten. 

16. Vorrichtung nach Anspruch 15, 

w bei der diese Prozessormittel zum (a1) Zeitverzerren Prozessormittel fur lineares Zeitverzerren dieser 
einen analysierten Wellenform einschlie/ten, um eine gestreckte Wellenform zu erzeugen. 

17. Vorrichtung nach Anspruch 14, 

bei der diese Mittel (4) zum Empfang Mittel zum gleichzeitigen Empfangen einer Vielzahl von 
75 Rundfunkinformationen einschlie/ten und bei der diese Prozessormittel (24) Mittel zum Ausfuhren der 

Prozessorfunktionen/Schritte (a) - (d) einschlie/ten, die bezuglich der Vielzahl der Rundfunkinformatio- 
nen im wesentlichen gleichzeitig ausgefuhrt werden. 

18. Vorrichtung nach Anspruch 14, 

20 bei der diese Prozessormittel zum (a) Vergleichen Prozessormittel einschlie/ten zum 

(a1) Verarbeiten dieser Rundfunkinformation, um eine Vielzahl N analysierter Signalmuster zu 

erzeugen, die dieser Rundfunkinformation entsprechen; 

(a2) Erzeugen eines Spektrogramms aus diesen analysierten Signalmustern; 

(a3) Zufuhren dieses Spektrogramms diesen Prozessormitteln zum (a) Vergleichen, um Vergleich 
25 mit den Bezugsmustern der ersten Stufe durchzufuhren und 

(a4) Zufuhren eines der analysierten Signalmuster den Prozessormitteln zum (c) Korrelieren, um die 
Korrelation mit den Bezugsmustern der zweiten Stufe ourchzufuhren. 

19. Vorrichtung nach Anspruch 18, 

30 bei der diese Prozessormittel zum (a) Vergleichen Prozessormittel (6) einschlie/ten zum 
(a1a) Bandpatffiltern dieser Rundfunkinformation in eine Vielzahl Bander; und 

(alb) Errechnen einer Vielzahl linearer Kombinationen dieser Vielzahl Bander, um die Vielzahl der 
analysierten Signalmuster zu erhalten. 

35 20. Vorrichtung nach Anspruch 19, 

bei der diese Mittel (6) zum (a1a) Bandpaflfiltern Mittel zum Gleichrichten (10) und dann Tiefpaflfiltern 
(12) eines jeden Bandes einschlie/ten. 

21. Vorrichtung nach Anspruch 18, 

40 bei der diese Prozessormittel zum (a2) Erzeugen eines Spektrogramms Prozessormittel einschlie/ten 
zum 

(a2a) Abtasten dieser Vielzahl analysierter Signalmuster mit einer vorgegebenen Abtastrate; 

(a2b) Aufstellen einer Zeit/Frequenz-Matrix mit N Frequenzkanalen, n Zeitkanalen und einer Vielzahl N- 

(n) Matrixelementen; und 

45 (a2c) Errechnen eines Matrixwertes (X N>n ) fur ein jedes Matrixelement in der folgenden Weise: 

X N , n (t) = (K, x V An (t)) + ... (Kn x V Nn (t)) 
worin: 

50 t der Zeitpunkt ist, zu dem die Abtastproben dieser Vielzahl analysierter Signalmuster 

genommen worden sind; 

V An bis V Nn die Amplitudenwerte der ersten bis N-ten analysierten Signalmuster sind, die zur 

Abtastzeit t abgetastet sind; und 
K, bis K N voraus ausgewahlte Konstanten sind, mit denen der Einflu/3 von Breitband- 

55 Impuls/Storenergie minimiert wird. 

22. Vorrichtung nach Anspruch 21 , 

bei der die Prozessormittel zum (a2) Erzeugen eines Spektromgramms Prozessormittel zum Normieren 
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dieser Matrix einschlie/ten, um das Spektrogramm zu bilden. 

23. Vorrichtung nach Anspruch 21 , 

bei der ein jedes der Bezugsmuster der ersten Stufe eine N x n-Bezugsmatrix aus Elementen umfaflt 
5 und wobei die Prozessormittel zum 

(a) Vergleichen Prozessormittel einschlie/ten zum 

(a5) Messen einer Variation zwischen einem jeden Element der Zeit/Frequenz- Matrix und einem 
korrespondierenden Element einer jeden dieser Bezugsmatrizen; 

(a6) Aufsummieren der gemessenen Variationen zwischen der Zeit/Frequenz-Matrix und einer jeden 
w Bezugsmatrix, um einen Abstandsme/Swert fur eine jede Bezugsmatrix zu schaffen; 

(a7) Vergleichen eines jeden Abstandsmetfwertes mit einem Schwellenwert; und 
(a8) Ausschlie/ten solcher Bezugsmuster der ersten Stufe, deren entsprechender Abstandsmeflwert 
diesen Schwellenwert ubersteigt. 

75 24. Vorrichtung nach Anspruch 23, 

bei der die Prozessormittel zum (b) Einordnen in die Reihenfolge Prozessormittel zum Einordnen nicht 
ausgeschlossener Bezugsmuster der ersten Stufe entsprechend ihrer Abstandswerte einschlietft. 

25. Vorrichtung nach Anspruch 14, 

20 bei der die Prozessormittel zum (c) Korrelieren Prozessormittel einschiie/ten zum 

(c1) Errechnen eines Korrelationswertes fur wenigstens ein Bezugsmuster der zweiten Stufe in 
Bezug auf die Rundfunkinformation; und 

(c2) Vergleichen dieses Korrelationswertes mit einem Korrelations-Schwellenwert und 
wobei dieser Verfahrensschritt (d) zum Klassifizieren das Klassifizieren dieser Rundfunkinformation 
25 hinsichtlich Ahnlichkeit mit nur demjenigen Bezugsmuster der zweiten Stufe einschlieflt, dessen 

entsprechender Korrelationswert diesen Bezugs-Schwellenwert ubersteigt. 

26. Vorrichtung nach Anspruch 14, 

bei der diese Prozessormittel Prozessormittel einschlie/ten zum 
30 (e) Analysieren dieser Rundfunkinformation, um einen spektral klar erkennbaren Anteil derselben zu 

identifizieren; 

(f) Bestimmen einer Guteziffer fur diesen klar erkennbaren Anteil unter Benutzung eines Scheitelwer- 
tes und einer Scheitelwert-Standardabweichung aus diesem klar erkennbaren Anteil; 

(g) Erzeugen eines Bezugsmusters der ersten Stufe aus diesem klar erkennbaren Anteil; 
35 (h) Testen des Bezugsmusters der ersten Stufe fur Zeitabgleichempfindlichkeit; 

(i) Wiederholen dieser Funktionen (e) - (h) dann, wenn das erzeugte Bezugsmuster der ersten Stufe 
den Zeitempfindlichkeitstest nicht passiert; und 

(j) Benutzen des spektral klar erkennbaren Anteils, um Bezugsmuster der ersten und der zweiten 
Stufe dann vorzusehen, wenn erzeugte Bezugsmuster der ersten Stufe den Zeitempfindlichkeitstest 
40 passieren. 

Revendications 

1. Procede de classification d'information diffusee, comprenant les etapes de : 
45 reception d'une information diffusee 

correlation (S270) de reformation diffusee a une bibliotheque de motifs de reference de second 
niveau ; et 

classification (S270, S300) de ladite information diffusee comme etant similaire a Tun desdits 
motifs de reference de second niveau sur la base de ladite etape de correlation ; 
50 caracterise en ce que : 

entre la reception et la correlation, ladite information diffusee est comparee (S150) a une 
bibliotheque de motifs de reference de premier niveau, lesdits motifs de reference de premier niveau 
etant mis en file (S170) selon un ordre correspondant a leur similarite par rapport a ladite information 
diffusee : et 

55 en ce que ladite bibliotheque de motifs de reference de second niveau correspond auxdits motifs 

de reference de premier niveau suivant I'ordre de file etabli lors de I'etape de mise en file. 

2. Procede selon la revendication 1 , incluant en outre les etapes de : 
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generation d'une pluralite de formes d'onde analysees a partir de ladite information diffusee ; 

modification temporelle d'une desdites formes d'onde analysees pour produire au moins une forme 
d'onde modifiee temporellement ; et 

dans lequel ladite etape de correlation inclut I'etape de correlation a la fois de ladite une forme 
s d'onde analysee et de ladite forme d'onde modifiee temporellement a ladite bibliotheque de motifs de 

reference de second niveau. 

3. Procede selon la revendication 2, dans lequel ladite etape de modification temporelle inclut I'etape de 
modification temporelle lineaire de ladite forme d'onde analysee pour produire une forme d'onde etiree. 

10 

4. Procede selon la revendication 1 dans lequel ladite etape de reception inclut I'etape de reception 
simultanee d'une pluralite d'informations diffusees et dans lequel lesdites etapes de comparaison, de 
mise en file, de correlation et de classification sont effectuees sur ladite pluralite d'informations 
diffusees, sensiblement simultanement. 

75 

5. Procede selon la revendication 1 , incluant en outre les etapes de : 

traitement de ladite information diffusee pour produire une pluralite N de motifs de signal analyses 
correspondant a ladite information diffusee ; 

generation d'un spectrogramme a partir desdits motifs de signal analyses ; 
20 production dudit spectrogramme pour ladite etape de comparaison afin d'effectuer une comparai- 

son par rapport auxdits motifs de reference de premier niveau ; et 

production de I'un desdits motifs de signal analyses pour ladite etape de correlation afin d'effectuer 
une correlation avec lesdits motifs de reference de second niveau. 

25 6. Procede selon la revendication 5 dans lequel ladite etape de traitement inclut les etapes de : 
filtrage passe-bande de ladite information diffusee selon une pluralite de bandes ; et 
calcul d'une pluralite de combinaisons lineaires de ladite pluralite de bandes pour produire ladite 
pluralite de motifs de signal analyses. 

30 7. Procede selon la revendication 6, dans lequel ladite etape de filtrage passe-bande inclut les etapes de 
filtrage passe-bande, le redressage puis le filtrage passe-bas de chaque dite bande. 

8. Procede selon la revendication 5, dans lequel ladite etape de generation d'un spectrogramme inclut les 
etapes de : 

35 echantillonnage de ladite pluralite de motifs de signal analyses selon un debit predetermine 

construction d'une matrice temps/frequence comportant N canaux de frequence, n canaux tempo- 
rels et une pluralite N(n) d'elements matriciels ; 

calcul, pour chaque element matriciel, d'une valeur matricielle X Nin comme suit : 

40 X N , n (t) = [Ki X V An (t)] + ... [Kn X V Nn (t)] 

ou : t est un instant auquel des echantillons sont pris parmi ladite pluralite de motifs de signal analyses 
V An a V Nn sont des valeurs d'amplitude des premier a N-ieme motifs de signal analyses prises a 
un instant d'echantillonnage t : et 
45 Ki a K N sont des constantes preselectionnees afin de minimiser I'influence d'une energie impulsive 

de diffusion. 

9. Procede selon la revendication 8, incluant en outre I'etape de normalisation de ladite matrice pour 
produire ledit spectrogramme. 

50 

10. Procede selon la revendication 8, dans lequel chaque dit motif de reference de premier niveau 
comprend une matrice d'elements de reference N x n et dans lequel ladite etape de comparaison 
comprend les etapes de : 

mesure d'une variation entre chaque element de la matrice temps/frequence et un element 
55 correspondant de chacune desdites matrices de reference ; 

sommation des variations mesurees entre la matrice temps/frequence et chaque matrice de 
reference pour produire une mesure de distance pour chaque matrice de reference ; 
comparaison de chaque mesure de distance a une valeur de seuil ; et 
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elimination des motifs de reference de premier niveau dont la mesure de distance correspondante 
excede ladite valeur de seuil. 

11. Procede selon la revendication 10, dans lequel ladite etape de mise en file comprend I'etape de mise 
5 en ordre de motifs de reference de premier niveau non elimines conformement a leurs mesures de 

distance correspondantes. 

12. Procede selon la revendication 1, dans lequel ladite etape de correlation inclut les etapes de : 

calcul d'une valeur de correlation pour au moins un motif de reference de second niveau par 
w reference a ladite information diffusee ; et 

comparaison de ladite valeur de correlation a une valeur de correlation de seuil ; et 

dans lequel ladite etape de classification inclut I'etape de classification de ladite information 

diffusee comme etant similaire a seulement le motif de reference de second niveau dont la valeur de 

correlation correspondante excede ladite valeur de reference de seuil. 

75 

13. Procede selon la revendication 1, incluant en outre les etapes d'apprentissage suivantes : 

(a) analyse d'une information diffusee de reference afin d'identifier une partie spectralement 
distinctive de celle-ci ; 

(b) determination d'une figure de merite pour ladite partie distinctive an utilisant une valeur de pic et 
20 une valeur de deviation standard a partir de ladite partie distinctive ; 

(c) generation d'un motif de reference de premier niveau a partir de ladite partie distinctive ; 

(d) test du motif de reference de premier niveau genere du point de vue de la sensibilite 
d'alignement temporel ; 

(e) repetition desdites etapes (a) a (d) lorsque le motif de reference de premier niveau genere ne 
25 satisfait pas au test de sensibilite temporelle ; et 

(f) utilisation de ladite partie spectralement distinctive pour produire des motifs de reference de 
premier et de second niveaux lorsque ledit motif de reference de premier niveau genere satisfait au 
test de sensibilite temporelle. 

30 14. Appareil de classification d'information diffusee comprenant : 
un moyen (2, 4) pour recevoir une information diffusee 

un moyen de traitement (18) pour correler I'information diffusee a une bibliotheque de motifs de 
reference de second niveau (34); et 

un moyen de traitement (24 ; 26) pour classifier ladite information diffusee comme etant similaire a 
35 Tun desdits motifs de reference de second niveau sur la base de ladite correlation ; 
caracterise par : 

un moyen de traitement (24 ; 26) pour comparer ladite information a une bibliotheque de motifs de 
reference de premier niveau (34) ; et 

un moyen de traitement (24 ; 26) pour mettre en file les motifs de reference de premier niveau 
40 selon I'ordre de leur similarite par rapport a ladite information, ledit moyen de traitement (18) de 
correlation se voyant appliquer ladite bibliotheque de motifs de reference de second niveau, lesquels 
correspondent auxdits motifs de reference de premier niveau selon I'ordre de file etabli. 

15. Appareil selon la revendication 14, incluant en outre un moyen pour generer une pluralite de formes 
45 d'onde analysees a partir de ladite information diffusee ; 

et dans lequel ledit moyen de traitement (24) inclut un moyen de traitement de (a1) modification 
temporelle de I'une desdites formes d'onde analysees afin de produire au moins une forme d'onde 
modifiee temporellement et dans lequel ledit moyen de traitement de (c) correlation inclut un moyen de 
traitement (18) pour correler a la fois ladite une forme d'onde analysee et ladite forme d'onde modifiee 
50 temporellement a ladite bibliotheque de motifs de reference de second niveau. 

16. Appareil selon la revendication 15, dans lequel ledit moyen de traitement de (a1) modification 
temporelle inclut un moyen de traitement pour modifier temporellement lineairement ladite une forme 
d'onde analysee afin de produire une forme d'onde etiree. 

55 

17. Appareil selon la revendication 14, dans lequel ledit moyen (4) de reception inclut un moyen pour 
recevoir simultanement une pluralite d'informations diffusees et dans lequel ledit moyen de traitement 
(24) inclut un moyen pour effectuer les fonctions de traitement (a) a (d) sur ladite pluralite d'informa- 
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tions diffusees, sensiblement simultanement. 

18. Appareil selon la revendication 14, dans lequel ledit moyen de traitement pour (a) effectuer une 
comparaison inclut un moyen de traitement pour (a1) traiter ladite information diffusee pour produire 

s une pluralite N de motifs de signal analyses correspondant a ladite information diffusee : pour (a2) 

generer un spectrogramme a partir desdits motifs de signal analyses ; pour (a3) produire ledit 
spectrogramme pour ledit moyen de traitement pour (a) effectuer une comparaison par rapport auxdits 
motifs de reference de premier niveau ; et pour (a4) produire Tun desdits motifs de signal analyses 
pour ledit moyen de traitement pour (c) effectuer une correlation avec lesdits motifs de reference de 

w second niveau. 

19. Appareil selon la revendication 18, dans lequel ledit moyen de traitement pour (a) effectuer une 
comparaison inclut un moyen de traitement (6) pour (a1a) filtrer passe-bande ladite information diffusee 
selon une pluralite de bandes ; et pour (alb) calculer une pluralite de combinaisons lineaires de ladite 

75 combinaison de bandes pour produire ladite pluralite de motifs de signal analyses. 

20. Appareil selon la revendication 19, dans lequel ledit moyen (6) pour (a1a) filtrer passe-bande inclut un 
moyen pour redresser (10) puis pour filtrer passe-bas (12) chaque dite bande. 

20 21. Appareil selon la revendication 18, dans lequel ledit moyen de traitement pour (a2) generer un 
spectrogramme inclut un moyen de traitement pour (a2a) echantillonner ladite pluralite de motifs de 
signal analyses selon un debit predetermine ; pour (a2b) construire une matrice temps/frequence 
comportant N canaux de frequence, n canaux temporels et une pluralite N(n) d'elements matriciels ; 
pour (ac2) calculer pour chaque element matriciel, une valeur matricielle X N , n comme suit : 

25 

X N , n (t) = [Ki x V An (t)] + ... [K N x V Nn (t)] 

ou : t est un instant auquel des echantillons sont pris parmi ladite pluralite de motifs de signal analyses 

30 V An a V Nn sont des valeurs d'amplitude des premier a N-ieme motifs de signal analyses prises a 

un instant d'echantillonnage t ; et 

Ki a K N sont des constantes preselectionnees afin de minimiser I'influence d'une energie impulsive 
de diffusion. 

35 22. Appareil selon la revendication 21, dans lequel ledit moyen de traitement pour (a2) generer un 
spectrogramme inclut un moyen de traitement pour normaliser ladite matrice pour produire ledit 
spectrogramme. 

23. Appareil selon la revendication 21, dans lequel chacun desdits motifs de reference de premier niveau 
40 comprend une matrice d'elements de reference N x n et dans lequel ledit moyen de traitement pour (a) 

effectuer une comparaison inclut un moyen de traitement pour (a5) mesurer une variation entre chaque 
element de la matrice temps/frequence et un element correspondant de chacune desdites matrices de 
reference ; pour (a6) sommer les variations mesurees entre la matrice temps/frequence et chaque 
matrice de reference pour produire une mesure de distance pour chaque matrice de reference ; pour 
45 (a7) comparer chaque mesure de distance a une valeur de seuil ; et pour (a8) eliminer les motifs de 
reference de premier niveau dont la mesure de distance correspondante excede ladite valeur de seuil. 

24. Appareil selon la revendication 23, dans lequel ledit moyen de traitement pour (b) effectuer une mise 
en file inclut un moyen de traitement pour effectuer une mise en file des motifs de reference de 

50 premier niveau non elimines conformement a leurs mesures de distance correspondantes. 

25. Appareil selon la revendication 14, dans lequel ledit moyen de traitement pour (c) effectuer une 
correlation inclut un moyen de traitement pour (c1) calculer une valeur de correlation pour au moins un 
motif de reference de second niveau par reference a ladite information diffusee ; et pour (c2) comparer 

55 ladite valeur de correlation a une valeur de correlation de seuil ; et dans lequel ledit moyen de 
traitement pour (d) effectuer une classification inclut un moyen de traitement pour classifier ladite 
information diffusee comme etant similaire a seulement le motif de reference de second niveau dont la 
valeur de correlation correspondante excede ladite valeur de reference de seuil. 
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26. Appareil selon la revendication 14, dans lequel ledit moyen de traitement inclut un moyen de traitement 
pour (e) analyser ladite information diffusee afin d'identifier une partie spectralement distinctive de 
celle-ci ; pour (f) determiner une figure de merite pour ladite partie distinctive en utilisant une valeur de 
pic et une deviation standard de valeur de pic a partir de ladite partie distinctive ; pour (g) generer un 

5 motif de reference de premier niveau a partir de ladite partie distinctive ; pour (h) tester le motif de 

reference de premier niveau genere du point de vue de la sensibilite d'alignement temporel ; pour (i) 
repeter les fonctions (e) a (h) lorsque le motif de reference de premier niveau genere ne satisfait pas 
au test de sensibilite temporelle ; et pour (j) utiliser ladite partie spectralement distinctive pour produire 
des motifs de reference de premier et second niveaux lorsque ledit motif de reference de premier 

w niveau genere satisfait au test de sensibilite. 
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